Gemma 4 - work4ai

Gemma 4

https://gyazo.com/14bfbd89fe30b4163974727408679d93

https://blog.google/innovation-and-ai/technology/developers-tools/gemma-4/?utm_source=tw&utm_medium=social&utm_campaign=og&utm_content=&utm_term=Gemma 4: Byte for byte, the most capable open models

https://huggingface.co/blog/gemma4Welcome Gemma 4: Frontier multimodal intelligence on device

Gemma-3nと同様に、Gemma 4は画像、テキスト、音声入力をサポートし、テキスト応答を生成します。テキストデコーダはGemmaモデルに基づいており、長いコンテキストウィンドウをサポートしています。画像インコーダーはGemma 3のものに似ていますが、2つの重要な改良点があります。アスペクト比の可変と、速度、メモリ、画質の最適なバランスを見つけるための画像トークン入力数を調整可能です。すべてのモデルは画像(またはビデオ)とテキスト入力に対応しており、小型のバリアント(E2BおよびE4B)は音声もサポートしています。

model zoo

https://huggingface.co/google/gemma-4-31B-itgoogle/gemma-4-31B-it

https://huggingface.co/google/gemma-4-26B-A4B-itgoogle/gemma-4-26B-A4B-it

https://huggingface.co/google/gemma-4-E4B-itgoogle/gemma-4-E4B-it

https://huggingface.co/google/gemma-4-E2B-itgoogle/gemma-4-E2B-it

ライセンス

Apache 2.0

Gemma 3/Gemma 3n以前とは変更されている

← Gemma 3/Gemma 3n

#Google